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Abstract. Let (X, P x ) be a continuous time Markov chain with finite or countable state 
space S and let T be its first passage time in a subset D of 5*. It is well known that if /j, is 
a quasi-stationary distribution relatively to T, then this time is exponentially distributed 
under P M . However, quasi-stationarity is not a necessary condition. In this paper, we 
determine more general conditions on an initial distribution /i for T to be exponentially 
distributed under P M . We show in addition how quasi-stationary distributions can be 
expressed in terms of any initial law which makes the distribution of T exponential. We 
also study two examples in branching processes where exponentiality does imply quasi- 
stationarity. 



1. Introduction 

Let us denote by P(t) = {pij(t) : i,j G S}, t > the transition probability of a continuous 
time irreducible Markov chain X = {(Xt)t>o, (Pi)ies}i with finite or countable state space 
S and let Q = {qij : i,j G S} be the associated g-matrix, that is = p^-(0). We assume 
that Q is conservative, that is Yljes Qij = 0> f° r au 3 ^ S, and that X is not explosive. The 
transition probability (that will also be called the transition semigroup) of X satisfies the 
backward Kolmogorov's equation: 

ipij (* ) = Yl qikVk 3 (*) ■ ( 1 ■ 1 ) 

kes 

Let D C S be some domain and define the first passage time by X in D by, 

T = inf{t > : X t G D} . (1.2) 

This work aims at characterizing probability measures fi on E = S\D such that under P„, 
the time T is exponentially distributed, that is, there exists a > 0, such that: 

P M (T > t) = e~ at . (1.3) 

It is well known that when [i is a quasi-stationary distribution with respect to T, that is if 

F^Xt =i\T >t) = Hi, for all % G E and t > 0, (1.4) 

then (jl.3p . for some value a > 0, follows from a simple application of the Markov property, 
see |15| or [6] for example. Quasi-stationarity of (J, holds if and only if /i is a left eigenvector 
of the g-matrix of the process X killed at time T, associated to the eigenvalue —a, see 
|17| . However, quasi-stationarity is not necessary to obtain (jl.3p . Some examples of non 
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quasi-stationary distribution \x such that (jl.3p holds are given later on in this paper. 

Our work was first motivated by population dynamics, where it is often crucial to 
determine the extinction time of a population or the emergence time of a new mutant, 
see [21 |3l HU [18] for example. In many situations, those times can be represented as first 
passage times of Markov processes in some particular domain. Then it is often much 
easier to find an initial distribution, under which this first passage time is exponentially 
distributed than to compute its distribution under any initial conditions. 

Let us be more specific about applications to emergence times in biology which is the 
central preoccupation of the authors in |2j- Adaptation to a new environment passes by 
the emergence of new mutants. In adaptation theory, emergence can be described by 
the estimation of the fixation time of an allele in the population. We may also imagine 
a parasite infecting a resistant or new host, a pathogen evading chemical treatment, a 
cancer cell escaping from chemotherapy, etc. |10 | [TT | IT3" 1 I22|. An interesting and important 
point is to estimate the law of the time at which these new mutant individuals emerge 
in the population, for example to estimate the durability or the success probability of a 
new treatment or a new resistance. Emergence problem has already been considered in 
the setting of branching processes |21|. [22], for multitype Moran models in [7], and for 
competition processes, in [2|. In order to explain the latter case in more details, let us 
recall that a competition process is a continuous time Markov chain X = (X^\ . . . ,X^) 
with state space S = N d , for d > 2, whose transition probabilities only allow jumps to 
certain nearest neighbors. Competition processes where introduced by Reuter |20| as the 
natural extensions of birth and death processes and are often involved in epidemic models 
[H [TU1 113] , In [2J, the authors were interested in some estimation of the law of the first 
passage time T, when an individual of type r = 1, . . . , d first emerges from the population, 
that is 



T = M{t > : X\ r) = 1} . 



Then varying the birth, mutation, migration and death rates, some simulations of the 
law of the time T allow us to conclude that the consideration of interactions among two 
stochastic evolutionary forces, mutation and migration, can expand our understanding 
of the adaptation process at the population level. In particular, it shows under which 
conditions on mutation and migration rates, pathogen can adapt swiftly to a given 
multicomponent treatment. 

This paper is organized as follows. In section 2, we establish a general criterion for a 
measure [i to satisfy (j 1 . 3 [) and we study the connections between such measures and quasi- 
stationary or quasi-limiting distributions. Then, in the third section, we give some sufficient 
conditions for (|1.3|l involving the special structure of the chain on a partition of the state 
space E. In particular, Theorem [3] and its consequences allow us to provide some examples 
where exponentiality may hold without quasi-stationarity. An example of application in 
adaptation theory is provided in Subsection 13.21 The fourth section is devoted to the 
presentation of some examples in the setting of branching processes where exponentiality 
implies quasi-stationarity. 
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2. From exponentiality to quasi stationarity 
We first introduce the killed process at time T, as follows: 

T _ J X t , if t < T, , . 

Xt ~ \ A, if t >T, (2 ' 1} 

where A is a cemetery point. Then X T is a continuous time Markov chain which is valued 
in := E U {A}. Moreover if we define the killing rate by 

Vi = ^2 Qij , (2-2) 

jen 

then the g-matrix Q T = (qE) of X r is given by 

{qij, i,jeE 
qiA = Vii i€ E (2-3) 
<?Aj = , j e E A . 

From our assumptions, Q T is obviously conservative and X T is non explosive. In particular, 
Q T is the g-matrix of a unique transition probability that we will denote by P T (t) = 
(Pij(t))i,j&EA> t — ®> anc ^ w hich is expressed as 

( F l (X t =j,t<T), i£i,j€E, 
pg(i) = < Fi(t > T) , if iEE and j = A, (2.4) 

lj=A j if i = A and j £ Ea^ 

Then this semigroup inherits the Kolmogorov backward equation from ([Lift : 

Henceforth, all distributions i/ on £a that will be considered will not charge the state A, 
i.e. z^a = 0. In this section, we shall often consider initial distributions ^ = (fJ>i)ieE A f° r 
(Xf), on Ea satisfying the following differentiability condition: 

fiP T (t) is differentiable and ^-fiP T (t) = fi-^P T (t), t>0. (2.6) 

(Ail/ Ubfj 

We extend the family of probability (Pj)ieE to i = A, in accordance with the definition of 
(P T (t)) and for each f > 0, we define the probability distribution n(t) on Ea as follows: 

tM (t)=F^X? = i\T>t), ieE A . (2.7) 

We define the vector 5 by <5j = 0, if i 6 -B and 5a = !• 

Theorem 1. Lei [i be a distribution on E/\. 

(i) Assume that \x satisfies condition (|2.6p . then there is a > swc/i i/iai P^(T > t) = 
e~ at , for all t > i/ and only if 

H(t) is differentiable and fjf(t) = e at (fiQ T + a(ji - 5))P T (t), t > 0. (2.8) 

(n) Assume that there is a > such that F^(T > t) = e~ at , for all t > 0, then conditions 



(12.61) and (|2.8p are equivalent. 
(Hi) When (|2.8|) is satisfied, the rate a may be expressed as 

a = y~) Vim ■ (2.9) 

ieE 
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Proof. Note that the condition P^(T > t) = e~ at is equivalent to P /t (X t T = i,t < T) = 
e~ at fii(t). So since 

¥„{Xf =%) = F^X? = i, t < T) + I i=A P /1 (t > T) , 

the transition function P T (t) of X T satisfies 

fj,P T (t) = e~ at ii(t) + (1 - e~ at )5 . (2.10) 

Then from the differentiability condition (|2.6p . we see that is differentiable and from 
the Kolmogorov backward equation (j2.5|) . we obtain 

^P T (t) = -ae- at n(t) + e- at ii'{t) + ae- at 5 

= fj,Q T P T (t) . (2.11) 

Then from (pTOjl . we have e~ Q V(*) = /xP T (t) - (1 - e-°*)<5 and since 5P T (t) = 5, for all 
t > 0, we see that equation (|2.1ip may be expressed as 

H'{t) = e at (fiQ T + a(fi-8))P T (t), t>0. 

Conversely, if condition (|2.8|l is satisfied, then from (|2.6p . we can write equation (|2.1ip . 
Integrating this expression, we get ()2.10p which implies that P^(T > t) = e~ at , for all 
t > 0. The first assertion of the theorem is proved. 

Now if P M (T > i) = e -0 *, for all t > 0, then we have (|2~T0|) . so that if condition ([22]) is 
satisfied, then n{t) is differentiable and 

j t ^P T {t) = -ae- at n(t) + e- at ii'{t) + ae~ at 5 . (2.12) 

Moreover from the Kolmogorov backward equation and (|2.12p . we have fiQ T P T (t) = 
— ae~ at fi(t) + e~ at fi 1 (t) + ae~ at 5, which is (|2.8|) . The converse is easily derived from similar 
arguments, so the second assertion is proved. 
Then from equation (|2.8p . we obtain 

lira u'(t) = (nQ T + a(u - 5))P T (0) . (2.13) 

t — 5-0 

On the other hand, note that //a(*) = 0, for all t > 0, so that in particular fi& = /^a(0) = 
and limt_^o Ma (*) := MaW = 0- Finally, taking equality (|2.13p at A yields 

M<5a = ^2 V^Ta = ^2 = Ma(°) - "(MA -5 a ) = a, 

i€E A i£E 

which proves the third assertion of the theorem. 

□ 

Remarks 1. It is important to note that a distribution \i on £a may satisfy P^(T > t) = 
e~ at , t>0, whereas (j2.6|) does not hold. Examples can be constructed from Corollary [5] in 
the next section. 

2. When E is finite, condition (j2.6|) is clearly satisfied. In the infinite case, this condition 
may appear theoretical to some extend and sometimes difficult to check when not much is 
known on the transition probability. However it is possible to obtain quite simple conditions 
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implying (|2.6p . For instance, observe that from (|2.5p . for all i,j S Ea and t > 0, 

d 



dt 



< 



\qfkPkj( 

k£E A 



< Ei^i 

fee^A 

= -2?£. (2.14) 
A sufficient condition for (|2.6p to ZioW is i/ien 

^qifJ-i <oo, (2.15) 

where qi = —qj^. The later condition is satisfied in particular when the qi 's are bounded. 

Recall definition (|1.4p of quasi-stationarity. In our setting, it is equivalent to the following 
statement: a distribution [i on Ea, is quasi-stationary if 

Hi = Hi(t) , for all t > and i € Ea- (2.16) 

We will simply say that \x is a quasi-stationary distribution. Then, let us state the following 
classical result, already mentioned in the introduction. 

Theorem 2 (|19|). A distribution /i on Ea is quasi- stationary if and only if the equation 

/j,Q T = -an + a5 , (2.17) 

holds for some a > 0. (Note that (j2. 17|) is equivalent to fiQj = —afii, for all i € E.) 

In |19| it is proved that (|2.17p is equivalent to the fact that P T satisfies the Kolmogorov 
forward equation, which is the case under our assumptions, that is 

J t Pl^ = S>ffc(*)9ki- ( 2 - 18 ) 

keE 

Knowing condition (|2.6p , it is actually quite easy to prove Theorem [2j Note also that under 
this assumption, Theorem [2] is a consequence of Theorem [TJ As a consequence of both these 
theorems we also obtain that (|2,6p holds whenever fi is quasi-stationary. 

Corollary 1. If fx is a quasi- stationary distribution then condition (|2.6p holds. 

Proof. If fi is quasi-stationary, then it follows from (j2. 16|) and the Markov property that 
P^(T > t) = e~ at , for some a > (this fact is well known, see [6], for instance). Moreover 
the function (i(t) is differentiable and n'{t) = 0, for all t > 0. On the other hand, from 
Theorem [2J equation (|2.17p holds. Therefore, condition fj2 . 8|) holds, so that (|2.6[) is satisfied 
from part (ii) of Theorem [TJ □ 

A distribution tt on Ea is called the quasi-limiting distribution (or the Yaglom limit) of 
a distribution /i on Ea, if it satisfies 

lim FJxT = i\T >t) = , for all i G E A . (2.19) 

i— >oo 

Then a well known result asserts that any quasi-limiting distribution is also a quasi- 
stationary distribution, see for example |15|. [6| IT6] . Recall also that if ir is the quasi-limiting 
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distribution of some distribution fj,, then the rate a satisfying (jl,3p is given by the expression 
a = inf|a>0:^ e at Pi{T > t) dt = ooj > , (2.20) 

which does not depend on the state i G E. As an application of Theorem [1] and the above 
remarks, we show in the next corollary how to construct quasi-stationary distributions from 
distributions satisfying (|l,3p . 

Corollary 2. Let fi be a distribution on such that P M (T > t) = e~ at , t > 0, for some 
a > and satisfying (j2.6j) . If [i admits a quasi-limiting distribution, ir, then the later is 
given by: 



7T = fl 



poo 

+ / (/iQ T + a( f i-5))e at P T {t)dt, 
Jo 



where (fj,Q T + a(fi — 5))e at P T (t) dt should be understood as a possibly improper integral. 
In particular, ir is a quasi- stationary distribution on 

Proof. Under these assumptions, it follows from Theorem [1] that for all t > 0, fjf(t) = 
e at (/j,Q T + a(fi — 5))P T (t). Moreover, since P M (T > 0) = 1, fi(t) is continuous at and 
n(0) = fi, so that 

(i(t)-fjL= [ {fiQ T + a(n-5))e au P T {u)du. 
Jo 

Since fi(t) converges to a proper distribution /u, as t tends to oo, it follows that the improper 
integral J °° (fJ-Q T + a(fi — 5))e au P T \u)du = lim i _ i . +00 J Q (fiQ T + a(/i — 5))e au P T \u)du exists 
and is finite. The fact that jjl is a quasi-stationary distribution follows from the results 
which are recalled before the statement of the corollary. □ 

Corollary [2] may be interpreted as follows: if [i is such that T is exponentially distributed 
under P^ and admits a Yaglom limit, then the correction term which allows us to obtain a 
quasi-stationary distribution from /j, is j^°(/j,Q T + a(fi — 5))e at P T (t) dt. 

The next result shows that whenever there exists a non quasi-stationary distribution 
which makes the time T exponentially distributed, then under some conditions, we may 
construct a whole family of distributions having the same property. 

Proposition 1. Let ji be a distribution on E& satisfying (|2.6p and such that P^(T > t) = 
e~ at , t > 0, for some a > 0. For n > 1, let us denote by q™j T the entries of (Q T ) n and let 

us define the vector (/4 )ie-E A j by 



(n) _ (-1)" ,,.„ n > T o a T? ,» 



3 



a' 



£/^ T , ^A=0. (2.21) 



i€E A 

If for all j G E, 

0<(-l) n ^Mf <«", (2-22) 

then (/i| n ^)j g £; A is a distribution on E& which satisfies P ( n )(T > t) = e~ at , for all t > 0. 
Proof. The assumption P M (T > t) = e~ at is equivalent to 

^2^. P J A (t) = l-e- at . (2.23) 
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Using condition (|2 . 6|) and the Kolmogorov backward equation (|2,5p . we obtain by differen- 
tiating the latter equality 

i£E \j£E A J 

Decomposing the left hand side and using (|2.3p and (|2.9p . we obtain 

-at 



ae 



which gives 



I>Ja(*) ( — J^Mfj ) = 1 - e - 

jeE \ i€E / 



at 



(2.24) 

Then from condition (|2.22p , we may let t tend to oo in (|2.24p , in order to obtain by monotone 
convergence that YljeE = 1> so that 

i£E 

is a distribution on Moreover (|2.24p is equation ()2,23p where we have replaced (i by 
//W, so that fi^ satisfies P (i)(T > *) = e~ at . 

The result is proved for n = 1. Then the proof is completed by iterating these arguments. 

□ 

Let /x be a distribution on such that P«(T > t) = e _Q?t , i > 0, for some a > 0. As we 
have already observed, if sup iGE qi < a, where % := — <?^, then condition (|2.6p is satisfied, 
but moreover, it is easy to check that for all n > 1 and for all j £ E, 

{-IT Y, W/If <« n , (2.25) 

which provides the second inequality in (|2.2ip . An interesting problem is then to determine 
simple conditions insuring the first inequality in (|2.2ip . that is nonnegativity of the term 

(-i) n £ mf- 

Corollary 3. Let fj, be a distribution on satisfying (|2.6|) and such that P At (T > t) = e~ at , 
t >0, for some a > 0. Define as in (|2.2ip . Then, 

1. if for some n > 1, is a quasi-stationary distribution, then = fj,( k+1 \ for all 
k > n. 

2. If the sequence of distributions /ttW converges, as n — > oo ; toward a proper distribu- 
tion /U^ 00 -*, then is a quasi-stationary distribution. 

Proof. The proof of the first part simply follows from the identity: 

^ +1) = -^Qf, iEE, 
a 
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and Theorem [2j The second assertion is a consequence of the same observation, which 
leads, under the assumption to, fA°°^Qf = — ad , i £ E. Then we conclude by applying 
Theorem [2j □ 

In particular if the state space E& is finite and — Q T is an honest matrix then from Perron- 
Frobenius theorem, assertion 2. of Corollary [3] is satisfied for the maximal eigenvalue a of 
-Q. 

3. Sufficient conditions for exponentiality. 

3.1. General results. Let us keep the notation of the previous sections. The next the- 
orem provides sufficient conditions for a distribution fj, to insure that T is exponentially 
distributed under P„. This result together with its Corollary [J] allow us to construct exam- 
ples for which such distributions exist. 

Theorem 3. Let {E\, E2, ■ ■ ■ } be a finite or infinite partition of S containing at least two 
elements and with E\ = D {in particular {E2,E^, . . . } is a partition of E). Assume that ji 
is a distribution with support in E that satisfies the following condition: 

(i) For all k > 2 and I > 1, with k ^ I and for all i € E k , the quantity X^'e-E; Qij does 
not depend on i. For i E E k , we set 

Then the following two conditions are equivalent. 

{ii) For all k > 1, the quantity P^PQ £ Ef. | T > t) does not depend on t > 0. More 
specifically, we have, 

F^Xt e E k I T > t) = p. k , t > , (3.2) 

where Jl k = Yl Mi- 

i£E k 

{Hi) There exists a > 0, such that 

p,Q = —aft + ad , 

where Q = (q~ki)k,l>i, Qik = 0, for k > 1, q kk = - Y,i>i Qkh for k > 1, p, = {fik)k>i 
andd= (1,0,0,...). 

Moreover, if conditions {i) and {ii) (or equivalently conditions {i) and {Hi)) are satisfied, 
then T is exponentially distributed under F^, with parameter a given by 

a = '^2q k iji k . (3.3) 

fc>i 

Proof. Let (lt)t>o be the continuous time process with values in {1, 2, . . . } which is defined 
by Y t = k, if X t G E k , that is 



Y t = J2kl {Xt eE k }, t>0. 



k>l 

Observe that T = inf{t : Yt = 1}. Then we will first show that under assumption (i), the 
absorbed process 

y T _( Y t , if t <T, , . 

is a continuous time Markov chain with q-matrix Q = {q k i) k ,i>i, as defined in {Hi). 
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Let us first assume that the number of sets in the partition (E k ) is finite and is n > 2. 
Recall from Section [21 the definition of {pjj(t) : i,j G S} and {qj- : i,j G S}, the transition 
probability and the q- matrix of the Markov chain X, killed at T. For all t > 0, define 



ku(t) :-- 



¥i{X t EE l ,t<T) = £ jei3; pg(t) , if i 6 E A and 2 < Z < n, 
Ff A (t), ii ieE A and 1 = 1. 



From the corollary, on page 132 in [5], the function t h-> is differentiable and for 

2 < I < n, 

Applying the Kolmogorov forward equation (|2.18p . we obtain, for 1 < I < n, 

fakaQ) = X X Pimtttemj 
jeE t meE 
n 

= E E E q ™i 

x=2m£E x j&Ei 
n 

= X X Pimi^Qxl 
x=2 m£E x 
n n 

= E3 = X ^ (*)* Ei ■ 

x=2 x=l 

Let ii G . . . ,i n G S n . Then from the above equality, the function K(t) = {ki i(t) : 1 < 
j, I < n} satisfies 

j t K{t) = K(t)Q , t>0, with K{0) =Id. (3.6) 

Since n < oo, the solution of (|3.6p is unique, this shows that for all I > 1, the functions 
t l— >• does not depend on i £ E^. Then we set 

Pfcz(t) = hl(t) , for 1 < k, I < n and i G 
and -P(t) = {pkl(t) 1 < k,l < n}, t > 0. Form (|3.6|) . we can write 

j t P{t) = P(t)Q , t > , with P(0) = /d . (3.7) 

Then let us check that Y T is a Markov chain with transition semigroup P(t), t > and 
(/-matrix Q. For all k±,..., k n , k n+ \ > 2, < s\ < S2 < ■ ■ ■ < Sn+i an d for any measure v 
on E, 

MY£ +1 = k n+ i \Yl = k u ...,Yl = k n ) 
= ^v{X Sn+l G E kri+1 , s n+ \ < T | X Sl G , . . . , X Sn <G Ek n ,s n <T) 

^( 1 {X Sl £E kl ,...,X an &E hn ,s n <T}^X a „(X Sn+1 - Sn G E kn+1 ,S n+ i - S n < T)) 

Wv(X Sl G E kl , . . . , X Sn G £'fc n ,s n <T) 

= Pknkn+i ( s n+l ~ s n) ■ 

The last identity follows from the fact that for any i G Ek n , 

kik n+1 (s n+1 - s n ) = Pi(X Sn+1 - Sn G E kn+1 ,s n+ i - s n <T) = pk n k n+1 (s n +i ~ s n ) , 
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which has been proved above. The case where k±, . . . , k n , k n+ \ may possibly be equal to 1, 
is proved similarly. 

Then recall from (Hi), the definition of the measure /i on fik = ^2i£E k Z 2 *' 

k > 2 and Ji\ = 0. Let (Pfe)fc>i be the family of probability laws associated to the Markov 
process {Yt)t>0- F° r all /c = 2, . . . , n, 

F„(X t 6 E k ,t < T) = J2 eE k ,t< T) 

i&E 
n 

1=2 ieE t 
n 

1=2 

= ffi(Y t = k,t<T), (3.8) 

where the third equality follows from the fact that ¥i(Xt € E)~, t < T) = ¥i(Yt = k,t < T), 
for all i £ E[. Assume that condition (ii) holds, then we derive from (|3.2p and (|3.8p that 
for all k = 2, . . . , n, 

f n (Y t = k\t<T)=j2 k , 

which means that fi is a quasi stationary distribution with respect to the lifetime of the 
Markov process Y T . In particular, thanks to Theorem^ (ii) and (Hi) are equivalent. More- 
over, since (|2.8|) in Theorem Q] is satisfied, then from (Hi) in this theorem, T is exponentially 
distributed under Pn, with parameter a = ^^^Qkifik- We conclude from equality (|3.8p 
which shows that F^t < T) = f p (t < T). 

Now assume that the number of elements in the partition (E k ) is countable. For all 

n > 1, define the set E^ = E x U E n+1 U E n+2 U . . . and set T n = mi{t : X t G E^}. Let 
us consider the process y( n ) defined by 

n 

Y t n) = 1 ( x^h + E kl {^ k} , t>0. 



{x t eE\ n >} 1 

fe=2 



Then we have T n = inf{i : Y^ = 1}, and the absorbed process 

v (n),T n _ ( Y+ \ if t < T n , /r, Q \ 

* "I i, ift>r B> (3 - 9) 

is a continuous time Markov chain. Indeed, the Markov chain X satisfies assumption (ii) 
of the theorem with respect to the partition {E^ , E 2 , ■ ■ ■ , E n } of S. Therefore, from what 
has been proved before, y( n )> T ™ is a continuous time Markov chain. Moreover, we easily 
check that y( n )> T » converges in law in the sense of finite dimensional distributions to Y T 
and that the later has Q for (/-matrix. Then the equivalence between (ii) and (Hi) follows 
in the same way as in the finite case. Moreover, when (i) and (ii) are satisfied then we 
conclude that T is exponentially distributed with parameter a = X^fc>i Qkifik m the same 
way as in the finite case. □ 

In the particular case where the partition {E 2 , E$, . . . } of E is reduced to the singletons of 
E, then condition (ii) is obviously satisfied and condition (i) simply means that fi is quasi- 
stationary with respect to T, hence the conclusion follows from Theorem [2j Conversely, by 
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considering {E} as a partition of E, we obtain the following consequence of Theorem [3j 
Recall the definition of r/i in (|2 . 2[) . 

Corollary 4. // there exists a > such that r]i = a, for all i G E, then the first passage 
time T has an exponential distribution with parameter a under F^, for all initial distribution 
\jl with support in E. 

Proof. We apply Theorem [3] with the finite partition {E\,E2} of S, given by E\ = D 
and E2 = E. Then we see that part (i) of Theorem [3] is precisely the assumption of the 
present corollary. Moreover, part (ii) simply follows from the form of the fact for all i G E2, 
Pi(Xj G E2, t < T) = Pj(t < T), in this particular case. Then we conclude from Theorem [3] 

□ 

Remarks 1. From Corollary® it becomes clear that there exist many instances of con- 
tinuous time Markov chains for which we can find initial distributions [i such that T is 
exponentially distributed under P^, whereas (|2.6p is not satisfied. 

2. Under the assumption of this corollary, if moreover there is a Yaglom limit, fj,, as recalled 
in the previous section, then fi is explicitly given by 

/•oo 

M = 1 W + / (1 W Q T + - 5))e at P T (t) dt . 

Jo 

In particular, this expression does not depend on i. 

3. In the case where S is finite, it is stated in Proposition 2.1, (ii) of [6] that if the sum of 
the rows of the matrix (qij)ij£E are constant, then T is exponentially distributed under P^, 
for all probability measure [i. This result is a consequence of Corollary [3] since when the 
sum of the rows of the matrix (qij)ij^E are constant, the rates r/i are constant. 

Actually it is always possible to compare the distribution of T with the exponential law, 
as Proposition 1 shows. Let us first define the continuous time Markov chain X by removing 
D from the state space of X. More specifically, the state space of X is E and its g-matrix is 
given by q(i,j) = q{i,j), if i^j and i,j G E and q(i,i) = - J2jeE,i^j for a11 i G E - 

We denote by {Pj : i G E} the family of probability distributions associated to X. Then 
we may check that {(Xt)t>o, (Pj)ig£;} is a continuous time irreducible and non explosive 
Markov chain on E. We first establish the following lemma which presents a decomposition 
of the first passage time T. 

Lemma 1. Let {e(i) : i G E} be a family of random variables which is independent of 
the process {(Xt)t>o, (Pj)ig£;}. We suppose that each random variable e(i) is exponentially 
distributed with parameter T]i. Let (T n ) n >% be the sequence of jump times of X , set Tq = 0, 
define I n = T n — T n _i, n > 1 and 

00 

f = J2(T n + e(X Tn ))ln n , (3.10) 

n=0 

where Q n = {e{X To ) > h, . . . , e(X Tn _ 1 ) > I n ,e(X Tn ) < I n +i}, for n > 1 and Q = 
{e(Xo) < Ii}. Then for all i G E, we have the identity in law 

[(X u ,u<T),¥ l ] = [(X u ,u<f),Fi]. 

Proof. This result is a direct consequence of the general structure of continuous time Markov 
chains. Indeed, it suffices to observe that when the process X is in a state i G E, the 
waiting time for hitting D is independent from the past and exponentially distributed, with 
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parameter rji. Hence it is clear that before time T, the process X behaves like a process in 
E, i.e. X killed at a time T, whose decomposition is given by equation (|3.10p . □ 

The following result provides exponential bounds for the distribution function of the first 
passage time. 

Proposition 2. Define the rates a$ = infjg^rji and ct\ = sup i£E rji, where rji is defined in 
(|2.2p . Then the tail distribution of the first passage time T satisfies the inequalities: 



e -ait < p.^ <T) < e- aot , (3.11) 

for all t > and for all i G E. 

Proof. It follows from Lemma [T] that for all £ G E and t > 0, 

Pi(t < T) (3.12) 
= Pi(t<T) 

oo 

= E E Ht<T n + e(X Tn ),n n \X To =i ,...,X Tn =i n )x 

n=0i ,...,i n £E 

¥i(X To =io,...,X Tn =i n ), (3.13) 

where iq = i. Then, from the Markov property and the assumption on the random variables 
{e(i) : % € E} in Lemma [TJ under Pj, conditionally on {Xt = io, ■ ■ ■ ,Xt u = i n }, the 
random variables I±, . . . , I n+ i,e(io), . . . , e(i n ) are independent. So with I = (1%, . . . , I n +i) 
and x = (xi, . . . , x n+ i), one has 

Fi(t <T n + e(X Tn ), n n | X To =i ,...,X Tn = i n ) (3.14) 
Pi (J G dx)Pi(£(i ) > cci, . . . ,e(i n -i) > x n ,t - t n < e(i n ) < x n+1 ) , 



where t n = X\ + • • • + x n . The integrand in the above integral may be written as 

i(e(i ) > xi,e{h) > x 2 , . . . , > x n , t - i n < e(i n ) < z n+1 ) (3.15) 



e 



??(io)^i -ri{ii)x 2 i)a;„/ e — ?7(i n )[(t-t„)V0] _ g-r?(i n )a:n+l , \][ 



where for n = 0, the term e W ) 311 *7(*i)e2 - ^(in-Oa™ j g understood to be 1. Hence by 
applying successively (|3. 15|) . (I3.14p and (|3.13p . it follows 



\(t < T) 

oo 

e 

\n=Q 



i,(x5)Ji tjC-x-;,!)/™ 



e -»?(x;)[(t-r n )vo] _ e -J7(x;)/ n+1 



l{t<T„ +1 } , 



where for each n, {X n : n > 0} is a sequence of random variables which has the same law 
as {X n :> 0} under P x and which is independent of the sequence {T n : n > 0}. Let be 
the (random) index such that T\. < t < T^+i, then we easily check that the term which is 
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in the expectation of the right hand side of the above equality is 
-r,(X' )h »7(^;_i)/, 



oo 
n=0 

e -v(x' )h 

oo 



e -ij(x;)[(t-T n )vo] _ e - v (x' n )i n+1 



- v {X' k ){t-T k ) _ n(x' k )i k ^ 



%<T n+1 } 



+ ^ e-^o)^— -^-i)^ i _ e -^;)/n+i 

n=fc+l 

= e -n{X' )h rKX^J/fcg-rXA^Xt-Tfc) _ 

But from the assumption, we have rj(X' n ) > ao, a.s., for each n > 0, so that it follows 
e -^W)Ji-...-i,(xi_ 1 )/fc e -^(jri)(t-2i) < e -«o(/i+-+4) e -«o(i-T fc ) = e - ao t j as . which gives the 

second inequality in (|3.1ip . 

We obtain the first inequality in (|3,lip by bounding from above the term Pj(T < i). 
More precisely, we show that Pj(T < t) = 1 - Pj(i < T) < e~ Ql< in the same way as we 
obtained the other bound. □ 

Note that Corollary 3] may also be obtained as a direct consequence on Proposition 1. It 
suffices to assume that ao = a i in this proposition. 

3.2. Application to the emergence time of a mutant escaping treatment. Let us 

consider the case of a pathogen population living on a host population. At each time t, the 
whole host population is either treated or not. The mutation rate depends on the pathogen 
population size. Since treatment reduces the pathogen population size, we assume that 
the mutation rate takes two different values, according to the presence or absence of the 
treatment. Then, the dynamics of the pathogen population size is described as a Markov 
chain X whose state space S is split up in three parts, that is S = E\ U Ei U E% , with : 

. E\ , the set of values of the pathogen population size when the population contains 
at least one mutant, 

. E2, the set of values of the pathogen population size when the population contains 

no mutants and its size is less than a given value K, 
. E3, the set of values of the pathogen population size when the population contains 

no mutants and its size is greater than K. 

The set E<i corresponds to the presence of treatment and £3 corresponds to its absence. 
Recall the notation of Theorem [3j The transition rates between E2 and E3 depend only 
on the treatment strategy and are considered to be constant equal to (J23 and 532. We 
also note the two transition rates to E\, that is the mutations rates, by 521 and (731. The 
emergence time is then defined as T = inf{£ > : Xt € E\}. 

Let us consider a treatment strategy ensuring that the probability for the pathogen 
population size to be less than K, before a mutation occurs, is ^(£2) = P>2- Similarly, the 
probability for the size to be greater than K, before mutation, is n(E^) = /I3 = l—p,2- From 
Theorem [3] T is exponentially distributed with parameter a > 0, if ji solves the equation : 

jlQ T = —ap, , (3.16) 

with 

iT / — Q23 — Q21 Q23 



Q32 -Q32 — Q31 
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Set a = + and 

921 - 931 + 923 + 932 - \J fel ~ 931 + 923 ~ 932) 2 + 4g 2 3<?32 

/^2 - O/"- ' 

2(921 - 93i) 

then we can check that p, = (^2,^3) is a solution of (|3.16p . so that with this choice for a 
and p, the time T is exponentially distributed with parameter a > 0. 

From a biological point of view, these results may be interpreted and used as follows. 
The rate p2 represents the proportion of time during which the host population has been 
treated after some time. From this proportion of time, we can determine the emergence 
time of a mutant pathogen. Then a treatment strategy can be designed. 

4. When exponentiality implies quasi-stationarity 

In this section, we present two examples where exponentiality implies quasi-stationarity. 
The first one concerns extinction times in branching processes and in the second one we set 
out the case of the emergence time of a new type in a multitype branching process. 

4.1. Extinction time in branching processes. Let (Z t , t > 0) be an irreducible critical 
or subcritical continuous time branching process on the set S = {0, 1, . . . } of nonnegative 
integers, as it is defined in Chapter III of [1]. Set D = {0} and recall the definition of the 
first passage time: 

T = inf {t : Z t = 0} , 

which is an absorbtion time in the present case. Then it is well known that under these 
assumptions, Pfc(T < oo) = 1, for all k > 1 and from the branching property, we have for 
all k E E = {1,2,...} and all t > 0, 

F k (T<t) = F k (Z t = 0) 

= pP!(Z t = 0)] fc . (4.1) 

Let us set qt = f\(Zt = 0), then from (|4.ip . for any probability measure fj, on E, the 
quantity P«(^ = 0) corresponds to the generating function of [i evaluated at the point 
qt, that is 

oo 

P M (Z t = 0) = £> fc (Z t = 0)M(fc) 

k=0 

oo 

= E «?/»(*) 

k=0 

= G^qt). (4.2) 

Note that q± is a non decreasing function. Let q^ 1 be its right continuous inverse, i.e. 
q^ 1 = mf{s : q s > t}. Then from (|4.ip and (|4,2p . for T to be exponentially distributed 
under with parameter a > 0, we should have for all t 6 [0, 1): 

G M (t) = 1 - e"^ 1 . (4.3) 

Equation (|4.3p shows that for each a > 0, there is exactly one measure (M = fj, a , with support 
on E such that T is exponentially distributed with parameter a, under P^. According 
to Theorem 3.1 in |14j . that is easily extended to discrete state space, continuous time 
branching processes, if the Malthusian parameter p of {Zt, t > 0), i.e. Ei(Zt) = e~ pt , is 
strictly positive, then for a < p, fj, a is a quasi-stationary distribution, whereas it is not in 
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the case where a > p. When p = there is no quasi-stationary distribution associated to 
T. 

Observe that sometimes the generating function G^ a may be derived explicitly For 
instance in [TJ, p. 109, it is proved that if Z is a birth and death process with rates A n = nX 
and v n = np, when the process is in state n, then 

ve {y-\)t _ v 

* = ve {u-\)t _ X ' 

With qt 1 = In (^30 ~, we obtain, 

G -« = 1 -(^))^' 

which allows us to recover the generating function of the unique quasi stationary distribution 
associated to T and a. In the case of continuous state branching processes, a similar 
expression for the Laplace transform of p a has been obtained in |14j . see p. 438 therein. 

4.2. Emergence time in multitype branching processes. Let Z = (Z^\ . . . , Z^) be 
a d-type, irreducible, supercritical branching process, see Section V.7 in [TJ . The state space 
of Z is then S = {0, 1, . . . } d . Let v = {v^\ . . . , i/W) be the offspring distribution of Z. We 
assume that for all i = 1, . . . , d, 

z^ (i) (0) = 0, 

so that, each individual has at least one child, with probability 1. In particular, the extinc- 
tion time is almost surely infinite. Then we are concerned with the emergence time of the 
subpopulation of type d, that is: 

T = ml{t : Z[ d) > 1} . 

For k = (k\, ■ ■ ■ , kd) € S, let Pk be the probability under which Z starts from k, i.e. 
Zo = k, Pfc-a.s. It is not difficult to check that under our assumptions, Pk(T < oo) = 1, 
for all k G S \ {0}. In the present case, the domain in which T is the first passage time 
is D = S \ ({0, 1, . . . j^ 1 x {0}), that is E = {0, 1, . . . j^" 1 x {0}. Let e, e 5 be defined 
as ei = (1, 0, . . . , 0, . . . , 0), e2 = (0, 1, . . . , 0, . . . , 0),... Then we derive from the branching 
property of Z that for all k € E, 

d-i 

P k (T > t) = Yl P ei {T > t) kl . (4.4) 

i=l 

(Recall that = 0, when k € E.) Let /ibea distribution whose support is included in E. 
Recall that the (multidimentional) generating function of p is defined by 

G M (t) = ^tf 1 ...t*^(k), fort = (t 1 ,...,t d )G[0,l) d . (4.5) 

Let us set q ijt = P Gi (T > t) and for ke E, let q t = (gi jt , . . . , q d)t ) and = q^ t x • • • x = 
x • • • x q d tit (recall that q^t = 1)- Then we derive from (j4.4|) and (j4.5|) that 

P M (T>t) = ^q^(k), t€[0,l), 

= G M (q t ). (4.6) 
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Let us assume that d = 2. Then in this case, the state space of the process Z killed at 
time T can be identified to {0, 1, . . . }, so we write P/%, k £ {0, 1, . . . } for the probability 
distributions associated to the killed process. Similarly to the situation presented in 14.11 
for each a > 0, the distribution fi a such that P„ a (T > t) = e~ at , is uniquely determined 
by equation (|4.6p . Indeed, if is the right continuous inverse of q\^ = Pi(T > t), then 

G^(t) = e~ a ^ , t>0. 

An open question is then to determine whether if \i a is quasi-stationary or not. According 
to Theorem[2j it amounts to check if fi a satisfies the equation fi a Q T = —U[i a + a5. Besides, 
from Theorem 1.1 in [8] this can be determined at least for the rate a which is defined in 
flZSID , that is 

a = inf | a > : J e at Pi{T > t) dt = oo J . 

Then provided a is strictly positive and lim/ c _ ) . 0O Pfc(T < t) = 0, for all t > (the 
latter condition is clearly satisfied in our case) the distribution fM a is a quasi-stationary 
distribution associated to the rate a. 

Remarks 1. In the case where S is a finite set, another example where exponentiality 
implies quasi- stationarity is given in part (iii) of Proposition 2.1 o/[B]. The Markov chain 
that is considered in this work is a random walk in the finite set {0, 1, . . . , iV} killed at 0. 
2. In the case of continuous state space Markov processes, other examples where exponen- 
tiality implies quasi- stationarity have been emphasized in [9] . In this work it is proved that 
if the absorbtion time of a positive selfsimilar Markov process is exponentially distributed 
under some initial distribution, then the latter is necessarily quasi- stationary. 
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